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Abstract — Epilepsy is one of the most common neurological 
disorders that greatly impair patient' daily lives. Traditional 
epileptic diagnosis relies on tedious visual screening by neurol- 
ogists from lengthy EEG recording that requires the presence 
of seizure (ictal) activities. Nowadays, there are many systems 
helping the neurologists to quickly find interesting segments 
of the lengthy signal by automatic seizure detection. However, 
we notice that it is very difficult, if not impossible, to obtain 
long-term EEG data with seizure activities for epilepsy patients 
in areas lack of medical resources and trained neurologists. 
Therefore, we propose to study automated epileptic diagnosis 
using interictal EEG data that is much easier to collect than 
ictal data. The authors are not aware of any report on automated 
EEG diagnostic system that can accurately distinguish patients' 
interictal EEG from the EEG of normal people. The research 
presented in this paper, therefore, aims to develop an automated 
diagnostic system that can use interictal EEG data to diagnose 
whether the person is epileptic. Such a system should also detect 
seizure activities for further investigation by doctors and potential 
patient monitoring. To develop such a system, we extract four 
classes of features from the EEG data and build a Probabilistic 
Neural Network (PNN) fed with these features. Leave-one-out 
cross-validation (LOO-CV) on a widely used epileptic-normal 
data set reflects an impressive 99.5% accuracy of our system 
on distinguishing normal people's EEG from patient's interictal 
EEG. We also find our system can be used in patient monitoring 
(seizure detection) and seizure focus localization, with 96.7% and 
77.5% accuracy respectively on the data set. 

Index Terms — Epilepsy, Electroencephalogram (EEG), Proba- 
bilistic Neural Network (PNN), seizure. 



I. Introduction 

EPILEPSY is a chronic disorder characterized by recurrent 
seizures, which may vary from muscle jerks to severe 
convulsions [1]. Estimated 1% of world population suffers 
from epilepsy [2], while 85% of them live in the developing 
countries [3], Electroencephalogram (EEG) is routinely used 
clinically to diagnose, monitor and localize epileptogenic 
zone [4]. Long-term EEG monitoring can provide 90% positive 
diagnostic information [5] and thus become a golden standard 
in epilepsy diagnosis. 

Traditional methods rely on experts to visually inspect 
the entire lengthy EEG recordings of up to 1 week, which 
is tedious and time-consuming. Therefore, many automated 
system assisting the diagnosis of epilepsy have emerged [6]- 
[12]. They could detect abnormal EEG segments related to 
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seizures so that doctors can quickly view events of interest 
without having to page through the entire recording [13]. 
But this approach requires the presence of seizure activities 
in the EEG data. This tough requirement often leads to 
very long, even up to 1 week, continuous EEG recording to 
capture seizure activities because of the difficulty to tell if and 
when a seizure will occur. The long-term EEG recording can 
greatly disturb patients' daily lives. Another clinical concern 
is that very unfortunetaly, 50-75% epilepsy patients in the 
world reside in areas lack of medical resources and trained 
professionals [3], which makes the long-term EEG recording 
virtually impractical to those people. Therefore, an automated 
EEG epilepsy diagnostic system would be very valuable if 
it does not require data from active seizure activities (i.e., 
ictal) to perform the diagnosis. However, to the authors' best 
knowledge, we are not aware of any report on automated 
epilepsy diagnostic system using only interictal EEG data. 

In this paper, we aim to develop an automated system that 
can diagnose epilepsy not only by using ictal EEG data but 
also interictal data. The diagnosis function of this system will 
be valuable for patients in areas lack of medical resources, and 
particularly well-trained personnels. Its capability of seizure 
detection will be the base of effective monitoring in personal 
health care and help doctors to do further diagnosis if neces- 
sary. In addition to diagnosis and seizure detection, we would 
also like the system to provide basic information on focus 
localization which is also an important aspect in diagnosis. 

Our system is a Probabilistic Neural Network (PNN) [14] 
based classifier. Previous research suggests PNN is more suit- 
able for medical applications, since it uses Bayesian strategies, 
a process familiar to medical decision makers [15]. We adopt 
PNN for its fast speed, high accuracy and real-time property 
in updating network structure, as we will explain in Sec. [IV] 
It is very difficult to directly use raw EEG data as input to an 
Artificial Neural Network [16]. Therefore, a key in designing 
the PNN classifier is to find proper features from the given 
EEG data and feed those feature values to the classifier (i.e., 
parameterized EEG data as input) . Artificial Neural Network 
has been used for automated diagnosis by several research 
groups [9]— [12]. But those work focus on seizure detection 
only. Since the interictal EEG does not have seizure activies, 
the features identified for those neural networks might not 
work for our purpose: diagnosing epilepy and localizing foci 
from interictal data. We use four classes of features, namely, 
power spectral feature, fractal dimensions, Hjorth parameters, 
and amplitude statistics. 
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Fig. 1 . Flow diagram of our EEG classification scheme 



Based on a widely used data set with epileptic and normal 
EEG data, our experiments indicate that interictal EEG of 
epileptic patients can be differentiated from those of healthy 
people with high accuracy and fast speed. Our interictal EEG 
based diagnostic approach achieves a 99.6% overall accuracy 
in cross-validation. The exisiting ictal data based strategy 
is also tested in our classifier, with 98.3% accuracty. Focus 
localization is achieved with a 78.5% accuracy. Our classifier 
is also capable of distinguishing interictal and ictal EEG and 
thus detecting seizures. The 96.9% accuracy underlines the 
possible patient monitoring. The speed of our classifier is very 
fast - 0.01 second per run in all four classification problems. 
These results imply the possibility of our system for real-time 
diagnosis, monitoring and focus localization. 



II. Data Acquisition 

In our experiments, we adopt the data set, which is widely 
used in previous epileptic diagnosis/analysis research [9], 
[12], [17], [18], from Klinik fur Epileptologie, Universitat 
Bonn, German [17]. It consists of five sets, each containing 
100 single-channel EEG segments. Each segment has 4096 
sampling points over 23.6 seconds. Note that artifacts, e.g., 
due to hand or eye movements, have been manually removed 
by the creators of the data. Data in sets A and B is extracranial 
EEG from healthy volunteers with eyes open and eyes closed 
respectively. Sets C and D are intracranial data over interictal 
period while Set E over ictal period. Segments in D are 
from within the epileptogenic zone, and those in C from 
the hippocampal formation of the opposite hemisphere of the 
brain. All EEG signals were sampled at a sampling rate of 
173.61Hz. Refer to [17] for detailed information of the data. 
The data was filtered by a low-pass filter of cutoff frequency 
40Hz. 



A. Power Spectral Features 

To a time series x\, £2, • ■ 
(FFT) X U X 2 ,--- ,X N 



,xn, its Fast Fourier Transform 
are estimated as 



n 



X k = J2*nW* 



N 



k = 1,2,--- ,N 



where W^ n 



= e « 



and N is the series length. 



We noted a clear difference that in general the ictal EEG 
has more power components in the higher frequency region 
(>14Hz) while non-ictal EEG are mostly below 14Hz. This 
point has also been described in [19]. From Fig. |2j it can be 
clearly seen that the central frequencies of peaks of different 
EEG signals lie in different regions. 



Healthy 




Interictal 



Ictal 



















































1 




> 












iU,.i 


1 


1 



Physical Frequency (Hz) 



Physical Frequency (Hz) 




Physical Frequency (Hz) 



Fig. 2. Typical FFT results of 3 EEG segments (Raw data in fiV. For FFT 
results, the scale in the Y-axis of ictal data is 10 times larger than the ones 
of healthy and interictal data) 



Based on the FFT result, Power Spectral Intensity (PSI) 
and Relative Intensity Ratio (RIR) are evaluated to each 2-Hz 
frequency band from 2-32 Hz. The PSI is defined as 



PSI k 



E 



1,2. 



15 



where f min = 2k, f max = 2k + 2, f s is the sampling 
frequency and N is the series length. As you can see, the 
fmin and fmax are the lower and upper boundaries of each 
2-Hz band, respectively. The RIR is defined as 



RIR, 



PSL 



J 



1,2, 



15 



So we have 15 PSIs and 15 RIPs. They are the first 30 features 
we used. 

The 2-32Hz band covers some EEG abnormalities unique 
to epilepsy [20], such as the 3Hz and 6Hz spike waves [21]. 



III. Feature Extraction 

Our classifier uses 38 features of 4 classes to characterize 
interictal EEG signal. The power spectral features describe 
energy distribution in the frequency domain. Fractal dimen- 
sions outline the fractal property. Hjorth parameters describe 
the chaotic behavior. Mean and standard deviation represent 
the amplitude statistics. Since normalization is very important 
to distance-based classifier, features are normalized before fed 
into PNN. 



B. Petrosian Fractal Dimension (PFD) 
PFD is defined as: 



login ^ + log 10 ( n+w ) 

where N is the series length and N$ is the number of sign 
changes in the signal derivative [22]. According to Fig. [3] (a), 
PFD is highly concentrated within each class and there is no 
overlap among the data for each class either. Therefore, all 
classes can be clearly distinguished using PFD. 
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Fig. 3. Average Petrosian Fractal Dimension (a) and Higuchi Fractal 
Dimension (b) for each set. Error bars denote one standard deviation (a) 
for each class of EEG. 



Fig. 4. Average Hjorth Mobility (a) and normalized Hjorth Complexity (b) 
for each set. Error bars denote one standard deviation (a) for each class of 
EEG. 



C. Higuchi Fractal Dimension (HFD) 

Higuchi's algorithm [23] constructs k new series from the 
original series Xi,X2, ■ ■ • ,xn by 

X m i Xm+k i X m +2k j ' ' ' , x m+ yN^_m^ k (1) 

where m = 1, 2, • • • , k. 

For each time series constructed from (Q~|), the length 
L(m, k) is computed by 

\ N — m 

T , M _ Hi=$ \x m +ik - x m+(i _i )fc |(AT - 1) 
The average length L(k) is computed as 

k 

This procedure repeats k max times for each k from 1 to 
kmax, and then uses a least-square method to determine the 
slope of the line that best fits the curve of ln(L(fc)) versus 
ln(l/fc). The slope is the Higuchi Fractal Dimension. In this 
paper, k max is 5. 

Fig. [3] (b) indicates HFD is also densely clustered within 
each class and there is a small overlap between classes C and 
E. HFD, therefore, is a good feature to characterize classes A, 
B and E. 

D. Hjorth Parameters 

To a time series X\,X2,--- ,Xjy, the Hjorth mobility and 
complexity [24] are respectively defined as 

V TP 

and 

j MA -TP 
V M2 ■ M2 

where TP = £>;/iV, M2 = T, d i/ N > MA = E( d * - 
di-i) 2 /N and di = x { - h-l 

According to Fig. [4] (a), Hjorth mobility has a tight dis- 
tributions within each class. Even though the Hjorth com- 
plexity appears very inconsistent among classes, since PNN 
uses normalized features, we compute the normalized Hjorth 



complexity and find it also has a tight distribution within each 
class. 

An interesting finding is that a set with low Hjorth mobility 
would have high normalized Hjorth complexity. For example, 
Sets C and D have the lowest mobility and highest complexity, 
which separates them from other sets. 

E. Means and standard deviations 

EEG signals from different conditions have different ampli- 
tudes. For example, the amplitude of normal activities ranges 
around 100/iV in this paper while the ictal EEG ranges around 
1000/iV. Means and standard deviations of both original data 
and the absolute values of EEG are evaluated. They are the 
last 4 features we used. 

IV. Probabilistic Neural Network 

An Al-based classifier is essentially a mapping / : W n — > 
U 1 from the feature space to the discrete class space. An 
Artificial Neural Network (ANN) implements such a map- 
ping by using a group of interconnected artificial neurons 
simulating human brain. An ANN can be trained to achieve 
expected classification results against the input and output 
information stream, so there is not a need to provide a specified 
classification algorithm. 

PNN is one kind of distance-based ANNs, using a bell- 
shape activation function. This technique makes decision 
boundaries nonlinear and hence it can approach the Bayesian 
optimal [25]. Compared with traditional back-propagation 
(BP) neural network, PNN is considered more suitable to 
medical application since it uses Bayesian strategy, a process 
familiar to medical decision makers [15]. The real-time prop- 
erty of PNN is also crucial to our research. In PNN, decision 
boundaries can be modified in real-time as new data become 
available [14]. There is no need to train the network over the 
entire data set again. So we can quickly update our network 
as more and more patients' data becomes available. 

Our PNN has three layers: the Input Layer, the Radial Basis 
Layer which evaluates distances between input vector and rows 
in weight matrix, and the Competitive Layer which determines 
the class with maximum probability to be correct. The network 
structure is illustrated in Fig. |3J using symbols and notations 
in [26]. Dimensions of arrays are marked under their names. 
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Fig. 5. PNN structure, R: number of features, Q: number of training samples, 
K: number of classes. The input vector p is presented here as a black vertical 
bar. 



A. Radial Basis Layer 

In Radial Basis Layer, the vector distances between input 
vector p and the weight vector made of each row of weight 
matrix W are calculated. Here, the vector distance is defined 
as the dot product between two vectors [25]. The dot product 
between p and the i-th row of W produces the i-th element 
of the distance vector matrix, denoted as ||W — p||. The bias 
vector b is then combined with ||W — p|| by an element- 
by-element multiplication, represented as " x" in Fig. [5] The 
result is denoted as n = ||W — p|| • xb. 

The transfer function in PNN has built into a distance 
criterion with respect to a center. In this paper, we define it as 



radbas(n) = e 



(2) 



Each element of n is substituted into (0 and produces cor- 
responding element of a, the output vector of Radial Basis 
Layer. We can represent the i-th element of a as 



a.i = radbas(\[Wi — p|| • xbj) 



(3) 



where Wj is the i-th row of W and b^ is the i-th element of 
bias vector b. 

1) Radial Basis Layer Weights: Each row of W is the 
feature vector of one trainging sample. The number of rows 
equals to the number of training samples. 

2) Radial Basis Layer Biases: All biases in radial basis 
layer are set to \/ln0.5/s resulting in radial basis functions 
that cross 0.5 at weighted inputs of ±s, where s is the 
spread constant of PNN. In this paper, s is set to 0.1, since 
our experiments show the highest accuracy is achieved when 
s = 0.1, as illustrated in Fig. [6] 

B. Competitive Layer 

There is no bias in Competitive Layer. In this layer, the vec- 
tor a is first multiplied by layer weight matrix M, producing 
an output vector d. The competitive function C produces a 1 
corresponding to the largest element of d, and 0's elsewhere. 
The index of the 1 is the class of the EEG segment. M is set 
to K x Q matrix of Q target class vectors. If the i-th sample 
in training set is of class j, then we have a 1 on the j-th row 
of i-th column of M. 



TABLE I 

Overall accuracy and classification time using PNN 



No. 


Experiment 


Accuracy 


Time (s) 


1 


normal (200 samples) 
vs. interictal (200 samples) 


99.5% 


0.01 


2 


normal (200 samples) 
vs. ictal (100 samples) 


98.3% 


0.01 


3 


interictal (200 samples) 
vs. ictal (100 samples) 


96.7% 


0.01 


4 


epileptogenic zone (100 samples) 
vs. opposite hemisphere (100 samples) 


77.5% 


0.01 



V. Experimental Results 

As shown in Table. [I] we designed four experiments to test 
the ability of our classifier to separate: 

1) normal EEG (sets A and B) and interictal EEG (sets C 
and D) 

2) normal EEG (sets A and B) and ictal EEG (set E) 

3) interictal EEG (sets C and D) and ictal EEG (set E) 

4) interictal EEG sampled from epileptogenic zone (set C) 
and interictal EEG sampled from opposite hemisphere 
(set D) 

The first two experiments evaluate the performance of our 
algorithm using interictal EEG and ictal EEG respectively. 

The last two experiments evaluate the feasibility of our 
algorithm on seizure monitoring and focus localization, re- 
spectively. 

The classifier is validated using leave-one-out cross- 
validation (LOO-CV) on 400, 300, 300 and 200 samples 
respectively in experiments 1, 2, 3 and 4. Our algorithm is 
implemented using the MATLAB Neural Network Toolbox. 
Table Q] lists the overall accuracy and classification time of 
four experiments. 

The spread constant of PNN, is seleted according to overall 
accuracy. As illustrated in Fig. [6] all experiments achieve 
the highest accuracy, when spread constant is 0.1. In our 
experiments, therefore, spread constant is set to 0.1. 




0.4 0.6 
Spread constant 



Fig. 6. Overall accuracy with respect to PNN spread constant 
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Our approach reaches 99.5% accuracy in using interictal 
EEG for epileptic diagnosis (Experiment 1). This result val- 
idates the feasibility to use only interictal data in epileptic 
diagnosis. 

The accuracy of our classifier using ictal data reaches 
98.3%, which is very similar to what has been reported from 
previous work [12]. 

In Experiment 3, 96.7% accuracy shows our system can 
distinguish ictal EEG from interictal EEG very well. This 
suggests the feasibility to continuously monitor patient status 
or detect seizures by classifying an EEG segment cut out from 
monitoring data. If the segment covering current time instant 
is classified as ictal, then the patient has been in an ictal state, 
i.e., the seizure is occurring. The focus localization experiment 
achieves a promising accuracy of 77.5%, which still needs 
further improvement. 

In all the 4 experiments, the classification time per run is 
0.01 second (on MATLAB R2008a for Linux, 1.6GHz 64- 
bit CPU, 2G RAM), which is very short compared with the 
EEG segment length, 23.6 seconds. This shows the feasibility 
of real-time monitoring. For long-term monitoring, we can 
periodically sample the EEG by a sliding window and analyze 
the windowed segment. For example, immediately after ictal 
activity detection, devices equipped with our algorithm can 
send out an alarm to healthcare providers. 

VI. Conclusions 

In this paper, an automated EEG recognition system for 
epilepsy diagnosis is developed and validated by cross- 
validation. Compared with the existing conventional seizure 
detection algorithms, our approach does not require seizure 
activity to be captured in EEG recording and thus is seizure- 
independent. This feature relieves the difficulties in EEG 
collection since interictal data is much easier to be collected 
than ictal data. 38 EEG features are extracted and PNN is 
employed to classify those features. 

Experiments indicate that interictal EEG of epileptic pa- 
tients can be differentiated from those of healthy people 
with high accuracy and fast speed. Our interictal EEG based 
diagnostic approach achieves a 99.5% overall accuracy in cross 
validation. Diagnosis based on ictal data is also tested in our 
classifier, reaching a high 98.3% accuracy. So, our algorithm 
works fine with both interictal and ictal data. We also extend 
the funtion of the classifier, to patient monitoring and focus 
localization. 96.7% accuracy is achieved on differentiating 
ictal from interictal EEG, which suggests the feasibility of 
online patient monitoring. The focus localization result is also 
promising with a 77.5% accuracy. The speed of our classifier 
is very good, costing only 0.01 second to classify an EEG 
segment of 23.6 seconds. 
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